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HE LIKES OR DISLIKES EACH DRAWING. THIS NON-LANGUAGE, 

OBJECTIVE METHOD OF ASSESSING PERSONALITY REVEALS TWO MAJOR 
factors— (1) THE TENDENCY TO LIKE OR DISLIKE FIGURES, AND (2) 
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The Welsh Figure Preference Test, published by Consid- 
ting Psychologists Press, consists of 400 black and white 
line drawings of varying complexity. In its original form, 
tne drawings on the test were presented individually on 
cards, but the subsequent booklet form, consisting of eight 
drawings per page, has been used more extensively. The 
examinee Is Instructed to indicate whether he likes or doesn»t 
like each of the drawings, either by marking an ”L” or a ”D” 
cn the separate answer sheet, a task which takes about 40 

Preliminary Manual for the test (Welsh, 
1959/ states that it can be used with persons of age six 
^d above and as many as 2? different scores obtained: 

DL (Don't Like Scale), RP (Repeat Scale), CP (Conformance 
Scale), BW (Barron-Welsh Art Scale), RA (Revised Art Scale). 

(Neuropsychiatric Scale), CN 

(Children Scale), in addition to 12 a priori content scores, 
an MV (Movement Scale) score, an PG (Figure -Ground Scale) 
score, and five sex symbol scores, 

Welsh's purpose in constructing the test was to provide 
a non-language , objective method of assessing personality. 

In a sense, the Welsh Figure Preference Test (WPPT) was to 
be a non-language form of the MMPI, a test which had proven 
so successful with literates. Initially (Welsh. 1949), 

143 people, six of whom were artists, sorted several hundred 
line drawings , drawn in black ink on 3 by 5 cards , into two 
piles— those that they liked and those that they disliked. 
After these sorts were obtal^ied, a factor analysis of the 
preferences revealed two major factors: (I) the tendency to 
like or dislike figures (acceptance-re jectlon factor) , and 
(II) the tendency to prefer simple, symmetrical figures as 
opposed to. complex, asymmetrical ones. The second bipolar 
factor was orthogonal to the first. 

Further study of the responses to these designs (Welsh, 
1949 ) indicated that individuals who preferred the simple, 
symmetrical figures tended to be conservative and conven- 
tional, whereas those who preferred the complex, asymmetrical 
designs tended to be more antisocial or rebellious. Psychol- 
ogists' ratings of these two groups of individuals depicted 
the first group as more enthusiastic, optimistic, conserva- 
tive, organi zed, and conventional, whereas those in the 

^The writing of this review was supported by the 
Creativity Research Institute of the Richardson Foundation, 
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second group were described as more cynical, pessimistic, 
depreclatlve, overtly hostile, and socially dissident. 



Reviews and Criticisms 

The Sixth Mental Measurements Yearbook (Buros, I965) 
contains two reviews of the WPPT. f he review by Gordon 
Anderson of the earlier card form of the test appeared first 
In the Journal of Consulting Psychology (i960) and Is con- 
cerned only wlth^some of the Initial research on the test. 
Anderson refers to the test's ultimate Intent of providing 
an Index of emotional adjustment and Identifying .and quantl- 
fylng personality characteristics. He then proceeds to 
describe the various scales, concluding that the test Is 
primarily a research tool In need of further validation work. 

In his review of the booklet form, Harold Borko makes 
a number of criticisms of the WPPT. After summarizing 
reliability coefficients obtained for various scales— .94 
and .90 for the HA Scale, .64 and .?4 for the MV Scale, and 
.92 to .51 for the BL Scale— Borko concludes that adequate 
reliability studies have not been done. In addition. It Is 
alleged that the standardizing populations were too small— 

100 male VA patients, 75 normal adult males, 75 normal adiULt 
females, and 42 boys and 40 girls aged six to eight. .Although 
Borko refers to Welsh's (1959) statement that the Barron- 
Welsh Art Scale Is In final form. It Is also noted that the 
Revised Art Scale Is not In final form, the Male-Female Scale 
has not held up under cross-validation, the Children Scale 
has not been validated, and the Neuropsychiatric Scale has 
not been cross-validated. Borko concludes that "the test 
does not measure many of the personality directions," and 
even those that are measured by the test "... are not meas- 
ured with much confidence" (p. I98). He recommends further 
empirical research to determine which sets of Items differ- 
entiate among various groups, since he feels that the test 
presently "... has little practical value" (p. 198). 

Both reviews of the WPPT Indicate that although the 
test Items are relatively content-free and do differentiate 
between groups, they don't predict well In the Individual 
case. In addition, the WPPT scores reportedly do not have 
high correlations with ratings of actual behavior. 

The major portion of the present review will be con- 
cerned with a consideration of the scales on which the 
greatest amount of research has been done. We shall not 
discuss all of the 27 scales listed In the 1959 manual, 
since the majority of these have received little research 
attention. We shall not be limited, however, to discussing 
only a sample of these 27 scales, because several new scales 
have been devised since the manual was published. Finally, 
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studies concerned with the relationship of WPPT scores to 
other tests and research idilch has used the WPPT to test 
Berg*s "deTlatlon hypothesis” will be summarized briefly. 



Barron-Welsh Art Scale 

is proper to consider the Barron-Welsh Art Scale 
(BW Art Scale) first, because more resecupch has been conduc- 
ted with this scale and the Revised Art Scale than all other 
scales combined. The original form of the BW Art Scale 
consisted of 40 Items disliked by artists significantly more 
often than people in general and 25 Items liked by artists 
significantly more often. The 40 Items whlcIPthe artists 
disliked were of the simple-symmetrical type, and the major- 
ity of the 25 Items which they liked were of the complex- 
asymmetrical sort. It was found that total score on this 
65-ltem scale differentiated between artists and non-artists, 
a finding which was cross-validated on two other groups 
(Barron A Welsh, 1952), 



Reliability of the BW Art Scale 

Wrlghtsman (1964) Investigated the test-retest relia- 
bility of the BW Art Scale, Por a group of 75 college- 
student examinees who took the test twice over an average 
Interval of 166 days, the reliability coefficient for total 
score on the BW Art Scale was ,799* Purthermore, the reli- 
ability of the ”Don»t Like” Items on the scale was ,761, and 
the reliability of the ”Llke” Items was ,782, 

Relatio nships of BW Art Scale to Personality Characteristics 

In a discussion of the eax*ller work with the BW Art 
Scale, Barron (1953a) summarized the correlates of the scale 
with the Adjective Check List and an attitude scale. One 
significant finding was a positive correlation, for a group 
of male graduate students, between rated originality and 
BW Art Scale scores. It Is reported that preference for 
complexity (high scores) on the BW Art Scale was positively 
related to rapid personal tempo, verbal fluency,, impulsive- 
ness, expansiveness. Independence of judgment, originality, 
and breadth of Interest, but negatively related to tlgldlty, 
control of Impulse by repression, social conformity, ethno- 
centrlsm, and political-economic conservatism. 



In another experiment, using an Asch-type arrangement, 
Barron (1953b) found that low scorers on the BW Art Scale 
tended to “yield” to group pressure by giving perceptually 
Inaccurate reports, whereas high scorers tended to be more 
“Independent” in their judgments, giving perceptually accur- 
ate reports in spite of pressure from the group to do other- 
wise. 
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In his book, Creativity and Psychological Health (I 963 ), 
Barron again suiamarlzed reseaurch on the BW Art Scale, The 
following are some of the Investigations referred to In that 
book. One study (Barron, 1952) was concerned with the per- 
sonality characteristics of 40 male graduate students, some 
of whom had scored high and others low on the BW Art Scale, 

Both groups described themselves on Gough *s Adjective Check 
List, with low scorers on the BW Art Scale marking the 
adjectives ’’contented, gentle, conservative, patient, peace- 
able, serious, Indlvldiiallstlc; stable, worrying, thrifty, 
dreamy, deliberate, moderate, modest, responsible, foreslghted, 
and conscientious” more often, and high scorers on the BW 
Art Scale msurklng the adjectives ’’gloomy, pessimistic, bitter, 
dissatisfied, emotional, pleasure -seeking, unstable, cool. 
Irritable, aloof, sarcastic, spendthrift, distract Ible, 
demanding. Indifferent, anxious, opinionated, temperamental, 
and quick” more often, Ratings obtained from others revealed 
that high scorers on the BW Art Scale were perceived as being 
more original and artlsticedly expressive and as having 
better aesthetic Judgment than low scorers. The responses 
of the subjects to an attitude scale showed low scorers on 
the BVJ Art Scale to be more socially conforming, more respect- 
ful toward tradition, more patriotic, and more morally Judg- 
mental than high scorers on the art scale. 

In a related Investigation, Schultz and Knapp (1959) 
administered a 62 -card form of the BW Art Scale and a 100- 
Item self -description word list to 36 ? male, non-psychlatrlc 
VA patients. The subjects were divided Into fotur groups 
according to their scores on the BW Art Scale: (I) Like both 
symmetrical and asymmetrical designs, (II) Like symmetrical 
and dislike asymmetrical designs, (III) Dislike symmetrical 
and like asymmetrical designs, and (IV) Dislike both symme- 
trical and asymmetrical designs. Then these four groups 
were compared with respect to the adjectives which they 
checked most frequently as descriptive of themselves. The 
results were analyzed by comparing the responses of group 
II to those of groups I, III, and IV combined, and the res- 
ponses of group III to those of groups I, II, and IV combined. 
Group II, who preferred symmetrical to asymmetrical designs, 
more often described themselves as ’’always looking on the 
dark side, always on the go. Impatient, on edge, play It 
safe, push myself, self-conscious, talker, tired, and worn 
out,” Group III, who preferred asymmetrical to symmetrical 
designs, more often described themselves as ’’broke, different 
thsui others, forgetful, like to figure things out, nervous, 
never had the breaks, pretty clear about what I believe, 
rugged, talker, and thinker,” 

We shall refer briefly to three other Investigations 
concerned with the relationships between BW Art Scale scores 
and personality, Martin (1954) used a 40-ltem form of the 
BW Art Scale and correlated the scores with a measure of 
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intolerance of ambiguity. The latter measure consisted of 
counting the number of questions that the subjects asked In 
attempting to clarify an ambiguous situation. An essentially 
zero correlation was found between BW Art Scale scores and 
the latter measure, 

Secrest and Jackson (196I) administered a forced-choice 
form of the BW Art Scale to 60 first year nursing students 
by randomly pairing simple auid complex figures. The correc- 
ted split-half reliability of the test In this format was 
,92, A correlation of ,35. significant at the ,05 level, 
was found between scores on this form of the art scale and 
reputed ^social Intelligence” for these subjects. Finally, 
Pine (1962), In a study employing 12 male students, found a 
correlation of between scores on the BW Art Scale and a 
measure of ^adaptive regression”, and a correlation of ,64 
between art scale scores and a measure entitled "control of 
primary process”. 

Preferences for Paintings 

Barron (1952) obtained the preferences of 40 graduate 
students for 105 postcard-size reproductions of paintings 
by famous European artists and related these preferences to 
scores on the BW Art Scale, It was found that low scorers 
on the BW Art Scale, who liked symmetrical designs, preferred 
paintings depicting hlgh-bom and holy people-^palntings 
concerned with religion, authority, and aristocracy. This 
group disliked paintings of women of low birth and occupation 
and radically experimental, unnatural, or esoteric paintings. 
The reverse wsts true of students idio made high scores on 
the BW Art Scale, and consequently preferred asymmetrical 
designs. This second group approved of the experimental, 
modem, primitive, and sensual paintings, and disliked reli- 
gious, traditional, aristocratic, and emotionally controlled 
paintings. 

Another Investigation along these same lines (Child, 
'’9o2) was concerned primarily with the relationship between 
the ratings of the aesthetic values of paintings by 22 
undergraduate males and expert Judges* ratings of the aesthe- 
tic values of the paintings. Although the students and the 
experts* Judgments showed very little relationship, there 
was a correlation of ,45 between BW Art Scale scores euid 
the preferences of the students for paintings, scored accor- 
ding to a scale of aesthetic value established from the 
experts* Judgments, 




Studies of Artists. Architects, and Scientists 

Hosen (1955) administered the JW Art Scale to 44 art 
students, eight art faculty members, and a group of non- " 
ai’tlsts matched with the artists on age and sex. The mean 
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BW Scale score of the advanced art students was 40.7, the 
mean of the beginning art students 39.0, the mean of the 
art faculty ^1.1, and the meeui of the non-artists 22.1, 

Thus, there were only small differences between the metas of 
the art students and art facility but a large difference 
between the meeins of the non-artist and artist groups. 

Another finding was that course grades of the art students 
correlated .34 with BW Art Scale scores, and ratings of the 
students* originality correlated .40 with art scale scores. 

Some of the results of MacKlnnon*s (I96la) study of 
architects and research scientists are presented in Table 1. 
Prom this table It may be seen that the mean BW Art Scale 
score of 40 '’creative” architects was 37.1, whereas the means 
of the two less creative architect groups were 29.5 and 
26.1. The mean of 15 "creative” scientists was 30.7, and 
the means for the less creative scientist groups were 22.1 
and 19o2. Thus, mean scores on the BW Art Scale differen- 
tiated between subsamples of architects and research 
scientists differing in level of creativity. MacKinnon 
(1961a) reports a correlation of .48 between rated creativity 
and scores on the BW Art Scale for these groups. 

Studies of Other Occupational Groups 

Additional support for the hypothesis of a positive 
relationship between creativity and scores on the BW Art 
Scale was obtained by Barron (1955, 1957) in an assessment 
program which employed 100 Air Force captains as subjects. 
Eight tests, or measures, of originality (Unusual Uses, 
Consequences, Plot Titles, Rorschach 0+, TAT: Originality, 
Anagrams, Word Synthesis Originality, and Inkblot Originality) 
were administered to each subject, and the sum of the stan- 
dard scores on these tests was related to several other vari- 
ables. The investigation focused on two groups— those scoring 
high on the eight-test composite (15 Ss, "Orlglnfid” group) 
and those scoring low on the eight-test composite (15 Ss, 
"Unoriginal” group). On the BW Art Scale, the mean of the 
Originals was 19.40 suid the mean of the Unorlglnals 12.67, 
a significant difference. In a further analysis of the data, 
however, Barron (1957) found no positive correlation between 
the BW Art Scale and the eight-test originality composite 
when scores on Terman's Concept Mastery Test were partlalled 
out. 



MacKinnon (1961b) summarizes data relating BW Art Scale 
scores to creativity in the occupational groups referred to 
above and others, and some of these findings are also presen- 
ted In Table 1. For example, the mean BW Art Scale score of 
a ssunple of 16 "creative” women mathematicians was 28.1, and 
the mean of 28 "representative” women mathematicians 26.9. 
Hlghly-creatlve industrial researchers scored a mean of 30.7, 
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Table 1 



Mean Scores on the Barron-Welsh Art Scale for Various Groups 



Sample 


Number 

of 

Subjects 


Mean 


Standard 

Deviation 


Artists (standardization group)® 


80 


40.3 


12.9 


Artists (first cross-validation 
group) 


30 


39.1 


13.8 


Architects I (’’creative” 
architects) 


40 


37.1 


9.8 


Writers (’’creative”) 


19 


32.9 


11.1 


Team members of first American 
expedition to attempt Mount 
Everest 


15 


31,5 


12.1 


Hesearch scientists I (’’creative” 
scientists” 


15 


30.7 


6.3 


Architects II (control sample for 
number 3 above) 


43 


29.5 


10.1 


Women mathematicians I (’’creative” 


) 16 


28.1 


12.5 


Women mathematicians II 
( ’’representative” ) 


28 


26.9 


15.4 


Men mathematicians I (’’creative”) 


26 


26.9 


12.7 


Architects III (”representatlve” 
American architects” 


41 


26.1 


12.1 


Hesearch scientists II (’’less 
creative”) 


15 


22.1 


14.1 


Men mathematicians (’’less 
creative”) 


21 


19.4 


10.1 


Scientists III (’’least creative”) 


15 


19.2 


8.7 


Unselected adult males 


343 


13.9 


11.2 



Note.- After Barron, 1963b, p. 3. 

®The corrected means on the revised, 62-ltem form of 
the BW Art Scale are slightly lower than these. 
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whereas the mean of industrial researchers of lower creatl* 
vity was only 19.2. It Is Interesting to note that the mean 
score on the BW Art Scale of a group of student engineers was 
only 21. 5 t a fact which MacKinnon Interprets as Indicating 
that student engineer::^ need stimulation of their aesthetic 
Interests if they are to become creative . 

Finally, Gough (I96I) obtained a correlation of .41 
between scores ca the BW Art Scale and ratings of creativity 
In 45 physical science and engineering researchers. 

Relationships to Other Tests 



In this section we shall complete our review of corre- 
lational studies on the BW Art Scale by summarizing briefly 
the results of two Investigations. Blerl, Bradburn, and 
Gallnsky (1958) administered the BW Art Scale and the Embed- 
ded Figures Test to groups of 62 female and 50 male under- 
graduates. In the case of the males there was a significant 
negative correlation (r - -.36, p<.0l) between speed on the 
Embedded Figures Teat and BW Art Scale scores. The corres- 
ponding correlation for women, however, was only -.08. The 
Investigators interpreted this result as Indicating that men 
who are more proficient In coping with the complex figures 
of the Embedded Figures Test prefer more complexity In art 
productions. 

Caraeena and King (1962) , In an exploration of the uni- 
tary nature of complexity as a personality dimension, 
obtained a score for each of 30 male and 30 female undergra- 
duates on the BW Art Scale, seven scores from the Blerl- 
Blacker scoring modification of the Rorschach, and a score 
on Kelly *s Repertory Test. The corr^^latlons of the other 
eight measures of complexity with BW Art Scale scores ranged 
from .12 to -.12, all of which were insignificant. Apparently, 
complexity is not a unitary dimension of personality. 

The Teaching of Creativity 

Until now we have dealt exclusively with correlational 
studies concerning the validity of the BW Art Scale. Recently, 
however. Brown (1964, 1965# 1966) has reported a series of 
experiments in which changes in BW Art Scale scores were used 
as the dependent, or criterion, variable. In the first of 
these experiments (Brown, 1964), the teacher in a semester 

2 

Recently, Professor Wayne Holtzman has communicated 
with the writer concerning* the results of a study on the 
ability of the WFPT to discriminate among a select group of 
artists, 8u:chitects, and engineers. Using a sample of 28 
artists, 31 architects, and ?6 engineers, significant differ- 
ences were obtained among the mean scores of the three groups 
on six of the a priori content keys and the Movement Scale 
of the WFPT. Of the three groups, the mean of the architects 
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course for elementary education juniors attempted to develop 
a creativity ”self” or “frame of reference”, built around a 
series of “creativity symbols”. In the students. The pur- 
pose was to train the students so that the use of one of the 
symbols would “trigger” the creativity frame of reference. 

This teaching procedure resulted In a mean gain of eight to 
nine points on the BW Art Scale. 

In a second study (Brown, 1965) a similar procedure 
was used to develop “creative sub-selves” which could be 
triggered by different symbols. The mean gain In BW Art 
Scale scores from Initial testing In February to retesting 
In May was 18 points for the group when the creativity sub- 
selves were triggered at retesting time. The mean gain was 
significantly higher (p<.001) than gains obtained iinder 
conventional conditions. 

Brown (1966) also reports on a project designed to 
produce changes In teachers’ feelings about children and 
teaching, their colleaigues, suid their own potential for 
growth. Significant changes (p<.005) In BW Art Scale scores 
for 29 teachers In four schools were associated with this 
project during Its first year. 

Criticism of the BW Art Scale 

It Is probably unnecessary to be too critical of studies 
that have employed the BW Art Scale, because many of the 
shortcomings of research with this scale are generally true 
of research on psychological tests. Prom the preceding 
review, however. It may be noted that the correlations of the 
BW Scale with other tests, although frequently significant, 
are not large, and the results rre infrequently cross-valida- 
ted. In addition, studies that have found differences between 
the means of various groups on the BW Art Scale are usually 
based on rather small samples. A fair conclusion after 
reviewing this work Is that the results are Interesting and 
provocative, but It Is not clear at this point exactly what 
aspect or correlate of “creativity” Is being measured by 
the BW Art Scale, 

Another criticism of the BW Art Scale was made by 
Moyles, Tuddenham, and Block (1965)« These writers report 



was highest and the artists lowest on the Freehand Line- 
Simple, Freehand Line-Complex, and Black scales, while the 
mean of the architects warS highest and the engineers lowest 
on the Ruled Line-Simple, Ruled Line-Complex, and Shading 
scales. Finally, the mean score on the Movement Scale was 
highest for the engineers and lowest for the artists. 

3 

•^See McWhlnnle (1965) for a review of reseeirch on 
aesthetic measurement and perceptual choice and an Indication 
of some of the contradictions in Investigations on this subject. 
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that the stimulus dimension of simplicity-complexity, which 
Is what Barron and others have frequently stated to be the 
dimension being measured by the BW Art Scale, is highly con- 
founded with the dimension of symmetry-asymmetry. A separa- 
tion of these two dimensions showed that they were, indepen- 
dently, of small Importance in determining figure preference. 
As a consequence of their findings, the writers Issued a word 
of caution against the tendency to attribute figure prefer- 
ence to the single dimension of simplicity-complexity and, 
by extension, to the simplicity and complexity of the person- 
alities of examinees. 



The Hevised Art Scale 

Welsh (1959) points to a difficulty with the BW Art 
Scale, stemming from the fact that the scoring standards on 
the 65-ltem scale are such that higher scores may be obtained 
when the examinee places more items in the Don*t Like (DL) 
category than in the Like (L) category. For example, a sub- 
ject can obtain a score of 40, a rather high score, simply 
by indicating that he doesn't like any of the items. Conse- 
quently, high correlations are obtained between DL score and 
total score on the BW Art Scale. In order to eliminate this 
difficulty, a Revised Art Scale (RA Scale) was constructed. 

The RA Scale consists of 60 items, 30 of which high scorers, 
as opposed to low scorers, on the BW Art Scale indicated that 
they like and 30 of which they indicated that they don't like. 
In constructing the scale, high and low groups of scorers 
who also had average DL scores were used. Thus, a scale 
was produced that had a high correlation with BW Art Scale 
scores but zero correlation with DL total scores. 

Correlations of the RA Scale With Other Tests 

It will be recalled from findings reported in an earlier 
section (Barron, 1957) that the BW Art Scale suid Terman's 
Concept Mastery Test apparently have substantial overlap. 

The question arises, therefore, whether the RA Scale and 
tests of intelligence also have a great deal in common. If 
the answer is in the affirmative, then the effectiveness of 
the RA Scale as an independent measure of creativity would 
be somewhat reduced. A partial answer to the question was 
obtained by Welsh in a study of 368 gifted high school stu- 
dents in a residential summer school. For this group, the 
correlation between two measures of intelligence , one verbal 
(Terman Concept Mastery Test) and the other non-verbal (D-48), 
was significant. But the correlation of both tests with the 
RA Scale was essentially zero. It was also found that stu- 
dents in the Academic Division of the program scored higher 
In tested intelligence , but students in the Art Division 
made higher scores on the RA Scale. It should be noted, 
however, that these statistics were obtained on a highly 
select group and that the correlation between intelligence 
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and RA Scale scores, for example, might prove to be somewhat 
higher with groups showing a greater range of scores (see 
Welsh, 1966), 

Mlllman and Chang (1966) correlated scores on the RA 
Scale with the scores of 32 undergraduates on the Graves 
Design Judgment Test and the Meler-Seashore Art Judgment 
Test. The rank-order correlation between the RA Scale and 
the Graves was ,46, whereas the correlation between the RA 
Scale and the Meier was -,l6, and between the Meier and 
Graves It was .19, The writers attribute the higher correla- 
tion between the RA Scale and the Graves as being due to the 
fact that both of these tests use abstract designs. 

In a comparison of three creativity measures— the RA 
Scale, Mednlck's Remote Associates Test, and a Slmll8U?ltles 
Test— Colman (1966) found a correlation of .52 (p<.10) between 
RA Scale scores and Remote Associates scores In a group of 
13 students with high RA Scale scores. On the other h&uid, 
for a group of 11 students with low RA Scale scores, there 
was a correlation of ,53 (p<.10) between RA Scale scores and 
mean nxjmber of words used per similarity on the Similarities 
Test. 

Relationship of RA Scale to Creativity Motivation 

Golann (1962a, 1962b) administered a "creativity motiva- 
tion" questionnaire, which measured preference for creative 
activities, 8Uid the RA Scale to sixth and eighth grade chil- 
dren. Then the correlations between these two variables were 
computed for different age and sex groups. The obtained 
correlations varied from -,l4 to +.37, four of them being 
significant at the ,05 level or better. Perhaps these results 
could have been Interpreted more meaningfully if Golann had 
also obtained measures of Intelligence and/or school grades 
on his subjects. In any event, he also reported that males 
who made high RA scores preferred self -expressive. Independent 
activities which allow for use of creative capacity, while 
males who made low RA scores preferred activities which were 
structured, assigned, familiar, or routine. 

RA Scores and Artists 

Raychaudhurl has Issued a number of reports (Raychaud- 
**url, 1961, 1962, 1963 1 1966a, 1966b, 1966c) on the RA scores 
of musical and painting artists, as compared to those of 
controls. In one Investigation (Raychaudhurl, 1966a), the 
mean RA Scale score of 30 musical artists of India was 29.00, 
and the mean of 50 non-artists was 19.OO, a significant 
difference (p<.01). The mean RA score of 60 painting artists, 
however, was 40.83, which is significantly higher (p<.01) 
than that of the musical artists. These obtained differences 
between artists and non-artists on the RA Scale are reinforced 
by other studies (Raychaudhurl, 196I, 1962, I963, 1966c). 
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Raychaudhurl Interprets the difference between the HA scores 
of the painters and musicians with the hypothesis that 
<'••• those who succeed In painting arts have greater ambiguity 
tolerance In visual field than that of those who succeed In 
arts employing a different sense-media, e.g. auditory In case 
of music »» ( sic . Raychaudhurl , 1966a, p. 3). 

In a more extensive study of the personality factors 
differentiating 30 professional musicians with high scores on 
the HA Scale from 30 non-creatlve subjects with low scores on 
the HA Scale, Haychaudhurl (1966b) had two clinicians analyze 
the differences between the responses of these two groups to 
the Horschach, the TAT, the Szondl Test, and a life-history 
Interview. The results Indicated that ”The creative musician 
Is more distinctly marked by his emotional and temperamental 
characteristics than by cognitive and motivational aspects 
of his personality structure” (Haychaudhurl, 1966b, p, 210). 
Thus, the musicians appeared to be more emotional, egocentric, 
exhlbltlonlstlc , and sensitive Individuals who were stimulated 
by frustration and preferred activities and situations that 
permit a greater degree of Individualism and self-expression. 

It was also concluded that familial and soclo-cultural field 
factors were more Important than heredity In determining 
musicianship. 

Personality Correlates of HA Scale Scores 

Cashdan' and Welsh (19^6) classified 311 adolescents 
attending the Governor *s School of North Carolina Into high 
(HA^46) euid low (HA^l6) creativity on the basis of their HA 
Scale scores and sex (Male/Female) and then analyzed the 
differences among these four groups In terms of their scores 
on the 24 scales of Gough *s Adjective Check List (ACL). The 
results were that the two high HA groups made significantly 
higher scores on the lability, heterosexuality, change, auto- 
nomy, stnd counseling readiness scales of the ACL than the 
two low HA groups, while the two low HA groups scored signi- 
ficantly higher on the deference and order scales of the ACL. 
Hegardlng differences between the sexes, males scoring high 
on the HA Scale also scored significantly higher than high 
HA females on the self-confidence, dominance, exhibition, and 
autonomy scales of the ACL, while high HA females scored 
higher on the ACL nurturance, heterosexuality, succorance, 
abasement, deference, and counseling readiness scales. However, 
there were also significant HA by Sex Interactions on the 
achievement, dominance, endurance, order, succorance , and 
abasement scales of the ACL. 

In a second part of the Cashdan and Welsh (1966) study, 
all subjects who scored 40 or higher on the HA Scale were 
classified according to specialty (art versus natural science) 
and sex. The results revealed no significant differences In 
ACL scores between the two specialty or two sex groups. 
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Significant specialty by sex interactions occurred, however, 
on the favorablllty and defensiveness scales of the ACL. In 
a summary of these results, the writers characterized the 
highly creative adolescent as an Independent, non-conforming, 
change-seeking individual who has open and active interper- 
sonal relationships. In contrast, the adolescent of low 
creativity, and the male in particular, is described as a 
somewhat compulsive, highly achievement-motivated individual 
who seems to be experiencing external pressures to succeed. 

It is also concluded that creative adolescents of both sexes 
are quite similar in their personality characteristics. 

Use of the RA Scale in Other Educational Settings 

In a study of creative teachers and pupils, James (1963) 
obtained a mean RA score of 39*33 for the former groups, which 
is quite close to that obtained with artists. James also 
observed a greater tolerance of distorted and diverse figures 
on the part of these creative teachers. 

A before-and-after study by Falcone (I962) used the AC 
Test of Creative Ability and the Welsh Figure Preference Test 
as both pretests and posttests. The purpose of the investiga- 
tion was to evaluate changes in scores on these tests associ- 
ated with a unit in a high school economics course designed 
to encourage students to develop their creative abilities. 
Materials used in the unit were a series of colored slides 
employed by the teachers in their regular classes. The find- 
ings were that the posttest mean (49.92) on the RA Scale of 
the WFPT was not slgnlf Icsuitly different from the pretest 
mean (50.00). In addition, there was a very low relationship 
between Increases in T-scores on the RA Scale for the 77 
students 8Uid whether or not they had prior experience in art 
or interior design. Finally, it should also be noted that 
the posttest mean (50.20) on the Conformance Scale of the 
WFPT was not slgnlf Icantly different from the pretest mean 

(50.01). 

RA Scale Scores of Various Age Groups 

Table 2 gives means and standard deviations of scores 
on the RA Scale as a function of age or grade level. Observe 
that, for both boys and girls, the means tend to Increase 
slightly from the third grade through the seventh grade suid 
then decline slightly through the tenth grade. Although the 
relatively high positive correlation (/o = .62 for Golann*s 
data) between means and standard deviations for boys should 
be taken into account in the statistical evaluation of the 
significance of the differences among means, there is at 
least a suggestion of a curvilinear trend of mean RA score 
across age group. If this trend were supported by other 
research it would certainly provide an opportunity for spec- 
ulation. For example, if RA Scale scores are a measure of 
creative potential, can it be concluded that creative poten- 
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Table 2 ^ 

Means and Standard Deviations for Various Age Groups on 
the Revised Art Scale of the Welsh Figure Preference Test 



Sample 


Number 

of 

Subjects 


Mean 


Standard 

Deviation 


Pour-to-slx year olds^^ 


20 


26.30 


5.^3 


Slx-to-elght year olds^ 


82 


22.59 


9.10 


School Children® 


Third grade boys 


21 


21.67 


‘*.73 


Third grade girls 


32 


21.81 


12.62 


Fourth grade boys 


62 


2i^.32 


9.93 


Fourth grade girls 


79 


23.29 


10.02 


Fifth grade boys 


4l 


23. 02 


11.89 


Fifth grade girls 


45 


24.09 


12.30 


Sixth grade boys 


88 


24.11 


12.03 


Sixth grade girls 


99 


24.38 


12.55 


Seventh grade boys 


52 


29.44 


12.66 


Seventh grade girls 


68 


28.13 


12.60 


Eighth grade boys 
Eighth grade girls 


,ll 


25.66 

26.83 


13.60 

12.78 


Ninth grade boys 


111 


23.20 


12.82 


Ninth grade girls 


96 


23.30 


12.76 


Tenth grade boys 


97 


22. 26 


11.47 


Tenth grade girls 


86 


21.52 


13.82 


Adults'^ 


Men 


75 


17.33 


11.46 


Women 


75 


19.33 


12.20 



^After Smith, 1962. 
^After Welsh, 1959. 



® After Golann, 1961. 
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tlal reaches a maxlmiun In junior high school and then declines? 
If this Is true, what aspects of maturation or educational 
structure are responsible for the rise and fall of creative 
potential? Whatever the answers to these questions, the rela- 
tionship of age to preference for designs on the WPPT should 
be a topic worthy of Investigation, And It has been Investi- 
gated, as will be seen in the next section, 

WPPT Scores as a Function of Perceptual and Mentstl Maturity 

The Children Scale 

It was stated earlier In this review that the WPPT could 
be administered to children as young as six years of age. 

But since the task on the WPPT Is relatively simple. It would 
seem that the test might even be administered to preschool 
children. And as Welsh (1959) Indicates, the WPPT has been 
given to children as young as 3i years. Smith (I962) notes, 
however, that children younger than years are usually 
unable to make the aesthetic discriminations required by the 
test. The preferences of preschool children are usually less 
reliable and distinctive than those of older children, because 
fritter respond more Immediately and affectively. Smith* s 
(1962) study of 20 preschool children found that the Don*t 
Like (DL) and RA Scale mean scores were actually higher than 
those of adults. But the Investigator hypothesized that the 
distinctiveness of children *s preferences for either simple, 
symmetrical figures or complex, asymmetrical ones Increases 
with maturity. 

The Preliminary Manual of the WPPT (Welsh, 1959) refers 
to a Children Scale of 50 Items obtained by comparing the 
responses of 82 boys and girls, aged six to eight, with those 
of 150 people-ln-general, Welsh notes, however, that the 
scale Is only In preliminary form and has not been cross- 
valldated, 

A Perceptual Maturity Scale 

Van de Castle (19^2) attempted to confirm Hesterly and 
Berg*s (1958) finding that the perceptual choices of children 
and adult schizophrenics are similar but that both groups 
differ from normal adults In their choices. The procedure 
was first to construct a "Perceptual Maturity Scale" consis- 
ting of 94 Items from a 144-1 tern short form of the WPPT which 
significantly dlffei'entlated between 200 elementary school 
pupils (second and third grades)' and 200 college students 
having low Schizophrenic scores on the MMPI, Next, this 
"Perceptual Maturity Scale " (PMS) was administered to new 
groups of elementary s.)hool children, college students, and 
adult schizophrenics. The mean score on the PMS was slgnlfl- 
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cantly h.1.gher for 11-12 year olds than for 7-8 year olds and 
higher for the college students than for the children. In 
addition, the mean score of the college students was signifi- 
cantly higher than that of the schizophrenics. But unlike one 
of the findings of Hesterly and Berg (1958) with the Perceptual 
Reaction Test (PRT), the mean score of the children on the 
PMS was also significantly lower than the mean of the schizo- 
phrenics. Van de Castle (1982) confirmed his results with 
a revised form of the PMS designed to eliminate the effects 
of "Don't Like" response bias that existed In the earlier 
form of the scale. He also concluded that Berg's PRT measures 
response bias rather than perceptual maturity. 

Further work by Van de Castle (I965) and Edwards (I98I) 
was oriented towaord the development of a more comprehensive 
scale of perceptual maturity that would distinguish among 
different age groups. Edwards' (198I) procedure was to admin- 
ister 150 Items from the WFPT to 200 children aged eight years 
or less and to 200 normal adults In the age range 21-49 years. 
The 150 Items were presented In a forced-choice foimat, each 
Item-pair consisting of a probable adult choice and a child 
choice, based on previous work. The 92 Item-pairs which 
significantly differentiated between the children and the 
adults were then assembled as a Perceptual Maturity Scale, 
and this scale was cross -validated on 100 eight to twelve 
year old children, 100 normal adults and 100 schizophrenic 
adults. It had been hypothesized that schizophrenic adults 
would make significantly lower scores than normal adults on 
such a scale, but this hypothesis was not confirmed. There- 
fore, the 72 out of 92 Items which significantly differentiated 
between the children and the normal adults were reassembled 
as a new scale, and each Item was given a one-point weight. 

The means of the total scores on this new scale showed a 
progressive Increase with age, and they were significantly 
different for normal adults and children. Consistent with 
the results of the Initial study, viz. the finding of an 
Insignificant difference between the mean scores of normal 
and schizophrenic adults, however, was the finding of a sig- 
nificant difference between the mesui scores of children and 
schizophrenic adults. In addition, maturity ratings of the 
children by their teachers were positively related, to a 
slight degree, to the PMS scores of the former, but there 
was no relationship between PMS scores and scores on tests 
of Intelligence or reading ability. 

In a further, more carefully controlled effort along 
these same lines. Van de Castlv'5 (1965) developed a 72-ltem, 
forced-choice form of Perceptual Maturity Scale, Scores on 
this form of the PMS had a reliability of .90 but zero 
correlation with IQ. Not only did mean scores on the scale 
show a steady Increase from age seven through adulthood, but 
students rated as behavlorally more Immature by their teachers 
had significantly lower mean PMS scores than students rated 
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as behavlorally more mature. Some other findings were: 

(a) The mean FMS score of a group of delinquent males 
was significantly lower than that of a group of 
non-delinquent males of the same age. 

(b) The mean PMS score of a group of adult schizophrenics 
was significantly lower than that of a group of 
normal adults. 

(c) A group of students who were low on a "maturity 
scale” consisting of Items from the California 
Psychological Inventory had a lower mean PMS score 
than a group of students who made high scores on 
the "maturity scale”. 

(d) For 42 college students, the correlation between 
PMS scores and scores on the Welsh Anxiety Scede of 
the MMPI was -.40. 

Table 3 shows some of the results obtained In a later 
study with various age groups of Americans and Cuna Indians. 

It will be noted that, for both Americans and Cuna Indians, 
mean PMS score Is an Increasing function of age, although 
the mean scores of the Indians are lower than those of the 
Americans at every age. 

These studies demonstrate that the PMS has a substantial 
degree of construct validity, although all of the Investiga- 
tions which have been reviewed are of the cross-sectional 
or correlational type. Van de Castle (1965) also notes that 
significant differences on the PMS appear only when the number 
of subjects Is large and that differences are obtained between 
the scores of males and females at the younger age levels. 

Mental Maturity and the WFPT 

In an extensive, well-conducted study, Watson (1964)^ 
took a slightly different approach to the relationship between 
maturity and responses on the WFPT. The subjects were 66 boys 
and 66 girls In each of three school class groups : gifted 
(1^125 on the WISC), regular classes (90^IQ^110 on the Cali- 
fornia Test of Mental Maturity), and mentally retarded (IQ^75) 
on the Stanford-Blnet) . The WFPT was administered to these 
396 children, and chi-square comparisons were made between 
the responses of the three groups to the 400 Items on the test, 
as well as their scores on 32 scales of the WFPT. As a result 
of the Item analysis, two cross -validated scedes were con- 

Watson (1964) also gives a good summary of the litera- 
ture on the WFPT to 1964. 



Table 3 



Average Scores on the Perceptual Maturity Scale (PMS) 
of Americans and Cuna Indians at Various Age Levels 







Americans 






Cuna Indians 


Age 

Level 














No. of 


No. of 


Average 


No. ol 


No. of 


Average 


Males 


Females 


PMS 


Males 


Females 


PMS 


10 


85 

85 


85 


45.11 


11 


2 


38.45 


11 


85 




27 


27 


12 


17 


17 


48.38 


38 


21 


45.62 


13 


17 


17 




51 


8 


14 

15 


19 


19 


54.13 


53 


33 


50.38 


19 


19 




58 


16 


16 

17 


19 


19 


55.78 


26 


10 


53.67 


19 


19 




25 


6 


18+ 

(Ad\Jlt) 


90 


90 


58.80 


12 


7 


54.70 


Total 


370 


370 


50.76 


301 


130 


47.93 



Note.- Personal communication from Dr. Robert Van de Castle. 
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structed: ( 1 ) a scale of 103 Items which discriminated 
mentally retarded children of low estimated mental age from 
the other groups, and (II) a scale of 50 items which discrim- 
inated gifted students of high estimated mental age from 
others. An analysis of the mean scores of these groups on 
32 scales of the WPPT indicated that the mentally retarded 
made significantly different scores from others on the Con- 
formance, Revised Art, Neuropsychiatric, Children, Male- 
Female, Movement, Shading, Dotted, Sex Symbol, and Mixed 
scales. In the main, the scores of the mentally retarded 
subjects suggested less conforming, more childlike responses, 
more consistent with the responses of the mentally ill than 
with those of normals. They disliked shaded figures, figures 
suggesting movement, and figures generally preferred by females. 
In contrast, the gifted subjects scored high on the Confor- 
mance and Revised Art scales and low on the Children and 
Neuropsychiatric scales. The answers of the gifted were more 
consistent with those of females in general, but they disliked 
the dotted and sex symbols, neutral figures. Another inter- 
esting finding was that, in general, the preferences of 
children in the regular classes were more consistent with 
those of the gifted than with those of the retardates. Fin- 
ally, there was, as expected, a significant difference between 
the means of boys and girls on the Male-Female Scale, a differ- 
ence which increased with age. 

In sum, Watson (1964) demonstrated that the thrermiental 
ability groups responded differently to the items of the WFPT, 
Therefore, it might be reasonable to suppose that certain 
items on the WFPT could be used as a non-verbal intelligence 
scale. Again, however, the fact that the WPPT can discrimi- 
nate between groups of different mental abilities should not 
be interpreted as meaning that it can differentiate between 
two specific individuals having different abilities. And 
Watson's (1964) conclusion that the WPPT csui now be used in 
special education classes appears, for this reason, to be 
questlonnable. 



Don't Like, Confoimance, and Male-Female Scales 
Don't Like Scale 



An examinee's Don't Like (DL) score on the WPPT is ob- 
tained simply by counting the number of items, out of 400, 
that he places in the "Don't Like" category. The Preliminary 
the WPPT (Welsh, 1959) reviews a series of unpub- 
j.xsnea investigations in which scores on the DL Scale were 
correlated with scor s on Gough's Adjective Check List, the 
California Psychological Inventory, the Edwards Personal 
Preference Schedule, suid the MMPI, The msuiual s.lso summarizes 
the iv^sults of the study by Schultz and Knapp (1959) referred 
to earlier in this review. The reader will recall that this 
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study found that subjects who liked the symmetrical and dis- 
liked the asymmetrical designs on the BW Art Scale described 

subjects who liked the 

asymmetrical and disliked the symmetrical designs. 

^ a more recent study, concerned with the diagnosis 

Boelllng, Hutton, and Mathews (1962) 
administered the WPPT and three other tests to 13 pairs of 

patients. ! 5 ach pair of subjects 
consisted of a brain-damaged patient and a schizophrenic 
patient matched on sex, race, education, and length of hospl- 
1 Ak DL score for male schizophrenics was 

brain-damaged men, and 

the me^ DL score for female schizophrenics was 111 . 00 , as 

b^tSeSn ^aln-damaged women. The differences 

between mean DL scores of the brain-damaged and schizophrenic 

groups were statistically insignificant, as were the dlffer- 

brain-damaged and schizophrenic 
patients on 26 other scales derived from the main secret 

Although the mean DL scores of both patient groups were quite 
Q CO \ than the means for normals reported by Welsh 

did not appear to vary with age or length 
of hospitalization. L»Abate et al. (1962) concluded that DL 

scale scores have questlonnable usefulness as indices of 
Drain damage. 

Confor mance and Independence of Judgment 

The Conformance (CF) Scale, also referred to as “Confor- 
mity Scale” or “Agreement with Consensus Scale", of the WPPT 
consists of 38 items, 13 of which were liked and 25 disliked 
significantly more often by 50 artists than by a group of. 

^®lsh (1959) reports a number of 
correlations between CP scores and scores on the 
Adjective Check List, the California Psychological Inventory 
and the MMPI. ^ 

Llm (Unpublished paper, no date) views the CP Scale as 
a measure of consensus in perception, similar to the Horschach 
popiuar response, which presumably measures the congruence 
tween the examinee’s perception and that of normal people. 
However, this measure of “perceptual consensus" did not 

^he Chinese and non-Chinese In a group 
of 140 children in a San Prauicisco school. Of course, these 
results may be Interpreted as indicating either that the 
Chinese and non-Chinese children perceive the world in similar 
ways or that the CP Scale Is not sensitive to differences in 
perception between the two groups. 

Three of the variables used by Wrlghtsman and Cook (1965) 

analysis of 73 personality, attitude, and aptitude 
variables were Number of “Like" Responses, Agreement with 
Consensus, and Independence of Judgment on the WPPT. The 
subjects were 177 female college students in Nashville, Tenn- 
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essee. Eleven factors were extracted, and the following 
factor loadings were obtained for the three WPPT variables. 
Agreement with Consensus had a loading of ,352? and Indepen- 
dence of Judgment a loading of -,5865 on Factor I: Rigidity; 
Independence of Judgment had a loading of -,252? on Factor 
III : Anti-Negro Attitudes ; Agreement with Consensus had a 
loading of ,3^1? and Number of ”Llke” Responses a loading of 
-,4l83 on Factor VI: Sociability, 

Masculinity-Femininity and the WFPT 

Welsh (1959) describes a Male-Female (MF) Scale on the 
WFPT which was constructed by contrasting the responses of 
75 men with those of 75 women. Although the original 46-ltem 
MF Scale did not hold up under cross-validation. Items from 
the WFPT were used subsequently In a number of Investigations 
of the mascidlnlty- femininity dimension. For example, Lessler 
(1964) used figures from the WFPT, along with other figures. 

In a study of the referents used In sorting line drawings 
Into male suid female categories. The results showed that 
some of the drawings (symbols) were sorted according to a 
cultural referent, others according to a genital referent, 
and still others according to both cultural and genital refer- 
ents simultaneously. In general, however, the subjects res- 
ponded to the symbols on the highest possible psychic level, 
l,e. In terms of cultural, rather than genital, referents 
when both were present simultaneously. 

Lacrosse (1965) used stimuli from the MF Scale of the 
WFPT, along with others. In a study designed to determine 
the relationship of biological sex, selected psychologic ad 
sex variables, intelligence, and perceptual rigidity to visual 
exploratory behavior In a sample of 20 girls and 20 boys In 
the second and sixth grades. The MF Scale designs were those 
preferred significantly more often by girls or boys and were 
selected to determine whether the complexity or sexual pull 
of a design would be the more Important variable In deter- 
mining total time spent looking at It, No relationship was 
found between intelligence , grade, or perceptual rigidity 
and visual exploratory behavior, viz, total time spent look- 
ing at the stimuli. In addition, there was no relationship 
between biological sex and total looking time, and the rela- 
tionship between psychologlcsd measures of masculinity-femini- 
nity and time spent looking at the stimuli was complex. 

The findings of this investigation underscored the multi- 
dimensionality of the concept of masculinity-femininity and 
suggested further work designed to isolate its components, 

Littlejohn (1966) investigated the responses of 324 
ninth-grade boys and 332 ninth-grade girls to conventional 
and subtle measures of masculinity-femininity. The conven- 
tional measures wc:i ’3 the MMPI Mf Scale, the California 
Psychological Inventory (CPI) Fe Scale, and Nichols' "Obvious" 
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Scale. The subtle scales were the Female-Male (FM) Scale 
of the WFPT— a 76-ltem, cross-validated version of the earlier 
Male-Female Scale— and Nichols* “Subtle" Scale. High scores 
oi>i all of these scales sure In the more feminine direction. 

The major hypothesis was that highly creative males and females 
—high creativity being defined as a score falling In the top 
third of the distribution of Revised Art Scale (RA) scores 
for the group of 656 subjects— would show a reversal of sex 
Identification which would be manifested by their scores on 
the masculinity-femininity measures. A comparison group of 
low creative students consisted of those scoring In the bottom 
third of the distribution of RA scores. Concerning scores on 
the FM Scale, the results verified the hypothesis In the case 
of males: The 113 high-creative boys scored significantly 
higher (mean FM = 25.76). However, the difference between 
the means of the two male groups was not significant on any 
of the other masculinity-femininity measures. In the case 
of the girls, the hypothesis was not confirmed. In contrast 
to what was predicted, the 98 high-creative girls also scored 
significantly higher than the 107 low-creative girls on the 
FM Scale. The mean FM score for the former group was 57.63* 
but for the latter group It was only 36.64. Interestingly 
enough, however, the direction of t^e differences between 
the means of the two female groups on three of the other four 
measures of masculinity-femininity was Just the reverse of 
that on the FM Scale, viz. In the direction hypothesized. 

Thus, low-creative girls scored significantly higher than 
high-creative girls on the "Subtle ’*, ''Obvious ", and MMPI Mf 
scales. 

Some related findings of Littlejohn's (I966) study are 
pertinent and of Interest. Further comparisons between the 
means of high-creative and low-creative boys and girls on 
11 other WFPT scales revealed that the means of the hlgh-cre- 
atlves was significantly higher than that of the low-creatives 
on the Barron-Welsh Art Scale, Children Scale, Movement Scale, 
Figure -Gromd Scale , Sex Symbols-Male Scale, Sex Symbols -Female 
Scale, and Sex Symbols -Neutral Scale. On the other hand, the 
mean of the high creatives was significantly lower than that 
of the low-creatlves on the Conformance Scale, Neuropsychla- 
trlc Scsde, Sex Symbols -Combined Scale, and Sex Symbols-Mlxed 
Scale.. The directions of the differences between the means 
of hlgh-creatlves and low-creatlves were the same for males 
and females. 

The consistent differences between the mesuis of high- 
creative and low-creative groups on various WFPT scales would 
Indicate that the Intercorrelations among the WFPT scales 
must be substantial. In fact, Littlejohn reports a correla- 
tion of .88 for the 324 males and ,81 for the 332 females 
between RA and FM. Also, Harris (I96I) gives Intercorrelation 
matrices for 16 WFPT variables, and inspection of these 
matrices shows that the high degree of correlation among many 
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of the WPPT scales cannot toe explained solely toy overlap 
among Items, toecause scales other than the BW Art Scale 
and HA Scale have few Items In common. 



Relationships of the WPPT to Personality Inventories 

In this section we shall consider a numtoer of correlates 
of responses to Items and additional scales on the WPPT. 

Twice during this review we have referred to the Schultz and 
Knapp (1959) study. In which It was found that the adjectival 
self-descriptions, given toy sutojects who preferred simple 
designs were different from those given toy sutojects who pre- 
ferred complex designs. In a related Investigation, Hosentoerg 
and Zlmet (1957) correlated scores on the California Putollc 
Opinion Scale (P Scale) with various scores on the WPPT for 
three small groups of students. It was found, as hypothesized, 
that high scorers on the P Scale, Interpreted as toeing more 
“authoritarian” In outlook, preferred balanced, two-dimen- 
sional, commonplace designs. In contrast, low scorers on the 
P Scale, Interpreted as toeing less “authoritarian” In outlook, 
preferred the more complex or deviant designs on the WPPT. 

WPPT and the 

It will toe recalled that Welshes (1959) expressed purpose 
In developing the WPPT was to construct a kind of non-vertoal 
form of the MMPI which would toe relatively free from faking 
and useful In personality assessment. Therefore, Investiga- 
tions which have related WPPT scores to MMPI scores are of 
particular Interest. The major question here Is: To what 
extent do WPPT scores measure the same variables that are 
measured toy the MMPI? 

Pepper (1957) Investigated the relationship of the A 
(“anxiety”) and R (“repression”) scales of the MMPI to WPPT 
scales. Pour groups of college students were tested: 9 
students scoring high on tooth the MMPI A and R scales (Hl-Hl), 
10 students scoring high on A tout low on R (Hl-Lo) , 20 students 
scoring low on A toht high on £ (Lo-Hl), and 6 students scoring 
low on tooth A and R (Lo-Lo). The results showed significant 
differences between the Lo-Hl group and the Hl-Lo group on 
the empirically derived Neuropsychiatric and £W Art scales 
and on the a priori content Ruled Line-Simple (angular, bila- 
terally symmetrical). Ruled Line-Total, Preehand Line-Complex, 
Simple-Total, and Shading and Cross-Hatched Line scales. 

The means of the Lo-Hl group were significantly lower than 
those of the Hl-Hl group on the Ruled Line-Simple, Ruled Line- 
Total, Simple-Total, and Black scales of the WPPT. In addi- 
tion, there were significant differences between the Hl-Lo 
and Hl-Hl groups on the Neuropsychiatric, BW Art, Ruled Line- 
Simple, Preehand Line-Complex, Shading and Cross-Hatched Line, 
and Movement scales of the WPPT. Pepper (1957) concluded 
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that the figure preferences of the Lo~Hl group, who preferred 
simple figures and rejected complex ones, are another manifes- 
tation of the constriction and control Indicated by their low 
"Anxiety”, high "Repression" r res on the MMPI. In contrast, 
the preferences of the Hl-Lo gx< up were more like those of 
artists. Finally, the Hl-Hl group showed a dislike for free- 
hand, messy, and complex figures, which Pepper also sees as 
consistent with their scores on the MMPI A and R scales. 

Another study employing both the WPPT and the MMPI was 
conducted by Wahba (1962), Two-hundred undergraduate males 
were classified Into four groups of 50 students each according 
to their scores on the A and R scales of the MMPI. Group I 
was high on A and low on R, Group II was high on A and high 
on R, Group III was low on A and low on R, and Group IV was 
low on A and high on R, A comparison of their responses to 
the 400 Items on the WPPT Indicated that Groups I and II 
combined (high A) preferred shaded and freehand, complex de- 
signs and rejected ruled, simple designs. These findings are 
consistent with those of Pepper (1957). On the other hand. 
Groups II and IV combined (high R) tended to dislike Items 
having male, combined or mixed sex symbols. 

In another analysis of his data, Wahba (1962) performed 
one-way and two-way multivariate analyses of varlauice on the 
total number of WPPT Items, confirming the significance of 
the discriminant set of scales and supporting the hypothesis 
that the A cuid R scales of the MMPI are orthogonal In their 
effects, A further Investigation with a more homogeneous 
sample of 95 males corroborated the findings of significant 
relationships between the WPPT scales and the MMPI R Scale 
obtained in the first study. The majority of the correlations 
of WPPT and the MMPI A dimension, however, did not hold up 
under cross-validation. 

Llm and Ullman (1961) used chi-square tetts to auialyze 
the relationships among the scores of 125 male neuropsychla- 
trlc patients on four WPPT scales and 12 MMPI scales, with 
the following results. The relationships of scores on the 
Don’t Like (DL), Conformance (CP), Revised Art (RA), and 
Neuropsychiatric (NP) scales of the WPPT to age, education, 
and IQ were all negligible. However, the chi-squares rela- 
ting CP to DL (V® = 13.97K CP to RA (X® = -24.18), CP to 
NP (X® = 8.19), and RA to NP (X® = -52.51) were highly 
significant. The writers Indicated that the major result 
was the finding of a significant association between RA and 
NP (r = -,792, N = 125). This high association Is Interpreted 
as reflecting the fact that "In mental Illness there Is a 
prlmltlvlzatlon of the perceptual processes which seems to 
be tapped by the WPPT" (Llm & Ullman, 1961, p. 6), 

Also of Interest In the Llm and Ullman (1961) study 
were the relationships of the MMPI clinical scales to the 
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WPPT. First, the MMPI Pa Scale and WPPT CP Scale were 
negatively associated == -4,84, N = 124), In addition, 
the MMPI P Scale was significantly related to the RA = 3.56) 
and NP (X® = -24.78) scales of the WPPT. Finally, the WPPT 
RA Scale was moderately related to the MMPI P = 3.56), 

D (X® = 2.65), and Mf (X® = 2.60) scales, but the WPPT DL Scale 
was not sl^,nlflcantly related to any of the MMPI scales. The 
writers point to the consistency of the finding of a signifi- 
cant positive association between CP and NP with the finding 
of a negative association between NP and the MMPI F Scale. 

WPPT Scores and the California Psychological Inventory 

Harris (1962) computed the correlations of the scores 
of 91 ninth-grade girls. 111 ninth-grade boys, 86 tenth-grade 
girls, and 97 tenth-grade boys on 17 scales of the WPPT with 
their scores on the 18 scales of the California Psychological 
Inventory (CPI), four Trait Rating Scales and Otis IQs, In 
general, the correlations of the WPPT scales with teachers* 
ratings of the students on the traits of intellectual effi- 
ciency, conformity, emotional stability, and social relation- 
ships were smaller than the correlations of the CPI scales 
with the trait ratings. In addition, the correlations between 
the 17 WPPT scales and the 18 CPI scales were rather small. 

There was a small, but significant, negative correlation 
between the WPPT RA Scale and the CPI Achievement via Confor- 
mance Scale, but RA Scale scores were not significantly rela- 
ted to IQ, These findings led Harris to question the use of 
the WPPT by high school guidance personnel in the counseling 
of adolescents. In his opinion, **It appears that personality 
assessment instrimients with more adequately demonstrated 
reliability and validity than Is available so far for the 
Welsh Figure Preference Test would be more appropriate for use 
by high school guidance personnel” (Harris, 1961, p, 74), 



The Deviation Hsrpothesls and the WPPT 

The "deviation hypothesis”, as formulated by Berg (1957) t 
states that: "Deviant response patterns tend to be general; 
hence those deviant behavior patterns which are significant 
for abnormality (atypicalness) and thus are regarded as symp- 
toms (earmarks or signs) are associated with other deviant 
response patterns which are in non-crltlcal areas of behavior 
and which are not regarded as symptoms of personality aberra- 
tion (nor as symptoms, signs, earmarks)” (p. 159). In order 
to test Eeig’s hypothesis of the generality of deviant response 
tendencies, Secrest and Jackson (I962) administered a forced- 
choice version of the WPPT, Berg’s Perceptual Reaction Test 
(PRT), and an Independence-Conformity Inventory to 64 female 
and 59 male college students. The scoring procedure was to 
give positive weights to responses which occurred infrequently 
on the three tests. An analysis of the intercorrelations 



26 



amons responses to the three tests gave little support to 
Berg*s deviation hypothesis. Scores on the WFPT-Devlatlon 
Scale had low, but significant, correlations with PBT-Devl- 
atlon scores (r = .19) and with the Independence-Conformity 
Scale (r = .28). 

A contention related to Berg*s deviation hypothesis Is 
that test Item content Is Irrelevant In personality measure- 
ment. In a study designed to test this hypothesis, Normeui 
(1963) ompared responses given by groups of male college 
students to content-lrrelevsuit stimuli with the responses 
which they gave to content-relevant test Items. The predic- 
tor V8u:lables used In the study were personality descriptive 
adjectives (Descriptive Adjective Inventory, or DAI), occu- 
pational titles (Occupational Preference Inventory or OPI), 
and figural drawings (WPPT), and the criterion variables 
were peer ratings on extroversion, agreeableness, dependa- 
bility, emotional stability, and culture. The WPPT was used 
as the content-irrelevant task, while the DAI and OPI were 
viewed as content-relevant. Plve empirical scoring keys were 
devised for each of the three predictors, and scores on thesd 
keys were correlated with the five criterion ratings. The 
findings were that although all five DAI scales were signi- 
ficantly related to the criteria, with only one exception 
the OPI scales had nearly zero correlations with the criteria, 
and all five WPPT scales also correlated essentially zero 
with the criteria. Norman Interprets these findings as 
falling to support Berg*s notion of the Irrelevance of test 
item content. 
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